Corpus: mar-in_web_2015_300K

Other corpora

4.4.1.5 Number of Word-N-grams at Sentence Endings

Number of word-N-grams for N=1...5 for the first K sentences

K # of words # of bigrams # of trigrams # of 4-grams # of 5-grams
100 57 88 94 95 96
1000 374 860 959 984 992
10000 1873 6611 8767 9409 9579
100000 12277 56758 89515 97492 98918
1000000 24911 136902 249642 287256 295786


Zipf's diagram for sentence endings


Gnuplot diagram

29566 msec needed at 2018-05-24 21:53